Search results for "robust statistics"
showing 10 items of 15 documents
Experimental and robust modeling approach for lead(II) uptake by alginate gel beads: influence of the ionic strength and medium composition.
2014
Abstract Systematic kinetic and equilibrium studies on the lead ions removal ability by Ca-alginate gel beads have been performed by varying several internal parameters, namely, number of gel beads, nature and composition of the ionic medium and pH, which allowed us to model a wastewater in order to closely reproduce the composition of a real sample. Moreover, the effects brought about the different ionic species present in the reacting medium have been evaluated. Differential Pulse Anodic Stripping Voltammetry (DP-ASV), has been systematically used to perform kinetic and equilibrium measurements over continuous time in a wide range of concentration. Kinetic and equilibrium data have been q…
How Universal Is the Relationship between Remotely Sensed Vegetation Indices and Crop Leaf Area Index? A Global Assessment
2016
This study aims to assess the relationship between Leaf Area Index (LAI) and remotely sensed Vegetation Indices (VIs) for major crops, based on a globally explicit dataset of in situ LAI measurements over a significant set of locations. We used a total of 1394 LAI measurements from 29 sites spanning 4 continents and covering 15 crop types with corresponding Landsat satellite images. Best-fit functions for the LAI-VI relationships were generated and assessed in terms of crop type, vegetation index, level of radiometric/atmospheric processing, method of LAI measurement, as well as the time difference between LAI measurements and satellite overpass. These global LAI-VI relationships were evalu…
An empirical test of marginal productivity theory
2014
We explore an hitherto unused approach to testing marginal productivity theory. Our method rests on the simple idea that, under the assumption of a linear homogeneous production function, residual profits are informative about the discrepancies between factor payments and marginal products. Our empirical application using data on manufacturing plants in Chile suggest moderate deviations from marginal productivity theory which depend on firm size.
Comparison of Internal Clustering Validation Indices for Prototype-Based Clustering
2017
Clustering is an unsupervised machine learning and pattern recognition method. In general, in addition to revealing hidden groups of similar observations and clusters, their number needs to be determined. Internal clustering validation indices estimate this number without any external information. The purpose of this article is to evaluate, empirically, characteristics of a representative set of internal clustering validation indices with many datasets. The prototype-based clustering framework includes multiple, classical and robust, statistical estimates of cluster location so that the overall setting of the paper is novel. General observations on the quality of validation indices and on t…
Robust Principal Component Analysis of Data with Missing Values
2015
Principal component analysis is one of the most popular machine learning and data mining techniques. Having its origins in statistics, principal component analysis is used in numerous applications. However, there seems to be not much systematic testing and assessment of principal component analysis for cases with erroneous and incomplete data. The purpose of this article is to propose multiple robust approaches for carrying out principal component analysis and, especially, to estimate the relative importances of the principal components to explain the data variability. Computational experiments are first focused on carefully designed simulated tests where the ground truth is known and can b…
Empirical Likelihood-Based ANOVA for Trimmed Means
2016
In this paper, we introduce an alternative to Yuen’s test for the comparison of several population trimmed means. This nonparametric ANOVA type test is based on the empirical likelihood (EL) approach and extends the results for one population trimmed mean from Qin and Tsao (2002). The results of our simulation study indicate that for skewed distributions, with and without variance heterogeneity, Yuen’s test performs better than the new EL ANOVA test for trimmed means with respect to control over the probability of a type I error. This finding is in contrast with our simulation results for the comparison of means, where the EL ANOVA test for means performs better than Welch’s heteroscedastic…
Improvement in fast particle track reconstruction with robust statistics
2014
The IceCube project has transformed one cubic kilometer of deep natural Antarctic ice into a Cherenkov detector. Muon neutrinos are detected and their direction inferred by mapping the light produced by the secondary muon track inside the volume instrumented with photomultipliers. Reconstructing the muon track from the observed light is challenging due to noise, light scattering in the ice medium, and the possibility of simultaneously having multiple muons inside the detector, resulting from the large flux of cosmic ray muons. This manuscript describes work on two problems: (1) the track reconstruction problem, in which, given a set of observations, the goal is to recover the track of a muo…
A vision-based fully automated approach to robust image cropping detection
2020
Abstract The definition of valid and robust methodologies for assessing the authenticity of digital information is nowadays critical to contrast social manipulation through the media. A key research topic in multimedia forensics is the development of methods for detecting tampered content in large image collections without any human intervention. This paper introduces AMARCORD (Automatic Manhattan-scene AsymmetRically CrOpped imageRy Detector), a fully automated detector for exposing evidences of asymmetrical image cropping on Manhattan-World scenes. The proposed solution estimates and exploits the camera principal point, i.e., a physical feature extracted directly from the image content th…
Robust estimation and inference for bivariate line-fitting in allometry.
2011
In allometry, bivariate techniques related to principal component analysis are often used in place of linear regression, and primary interest is in making inferences about the slope. We demonstrate that the current inferential methods are not robust to bivariate contamination, and consider four robust alternatives to the current methods -- a novel sandwich estimator approach, using robust covariance matrices derived via an influence function approach, Huber's M-estimator and the fast-and-robust bootstrap. Simulations demonstrate that Huber's M-estimators are highly efficient and robust against bivariate contamination, and when combined with the fast-and-robust bootstrap, we can make accurat…
Sample-size calculation and reestimation for a semiparametric analysis of recurrent event data taking robust standard errors into account
2014
In some clinical trials, the repeated occurrence of the same type of event is of primary interest and the Andersen-Gill model has been proposed to analyze recurrent event data. Existing methods to determine the required sample size for an Andersen-Gill analysis rely on the strong assumption that all heterogeneity in the individuals' risk to experience events can be explained by known covariates. In practice, however, this assumption might be violated due to unknown or unmeasured covariates affecting the time to events. In these situations, the use of a robust variance estimate in calculating the test statistic is highly recommended to assure the type I error rate, but this will in turn decr…